AITopics | parametric function

Collaborating Authors

parametric function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Inference Compute-Optimal Video Vision Language Models

Wang, Peiqi, Peng, ShengYun, Zhang, Xuewen, Yu, Hanchao, Yang, Yibo, Huang, Lifu, Liu, Fujun, Wang, Qifan

arXiv.org Artificial IntelligenceMay-27-2025

This work investigates the optimal allocation of inference compute across three key scaling factors in video vision language models: language model size, frame count, and the number of visual tokens per frame. While prior works typically focuses on optimizing model efficiency or improving performance without considering resource constraints, we instead identify optimal model configuration under fixed inference compute budgets. We conduct large-scale training sweeps and careful parametric modeling of task performance to identify the inference compute-optimal frontier. Our experiments reveal how task performance depends on scaling factors and finetuning data size, as well as how changes in data size shift the compute-optimal frontier. These findings translate to practical tips for selecting these scaling factors.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.18855

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

PHODCOS: Pythagorean Hodograph-based Differentiable Coordinate System

Arrizabalaga, Jon, Vega, Fausto, ŠÍR, Zbyněk, Manchester, Zachary, Ryll, Markus

arXiv.org Artificial IntelligenceOct-10-2024

This paper presents PHODCOS, an algorithm that assigns a moving coordinate system to a given curve. The parametric functions underlying the coordinate system, i.e., the path function, the moving frame and its angular velocity, are exact -- approximation free -- differentiable, and sufficiently continuous. This allows for computing a coordinate system for highly nonlinear curves, while remaining compliant with autonomous navigation algorithms that require first and second order gradient information. In addition, the coordinate system obtained by PHODCOS is fully defined by a finite number of coefficients, which may then be used to compute additional geometric properties of the curve, such as arc-length, curvature, torsion, etc. Therefore, PHODCOS presents an appealing paradigm to enhance the geometrical awareness of existing guidance and navigation on-orbit spacecraft maneuvers. The PHODCOS algorithm is presented alongside an analysis of its error and approximation order, and thus, it is guaranteed that the obtained coordinate system matches the given curve within a desired tolerance. To demonstrate the applicability of the coordinate system resulting from PHODCOS, we present numerical examples in the Near Rectilinear Halo Orbit (NRHO) for the Lunar Gateway.

algorithm, coordinate system, phodco, (17 more...)

arXiv.org Artificial Intelligence

2410.0775

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Europe > Czechia > Prague (0.04)
(7 more...)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

Neural Term Structure of Additive Process for Option Pricing

Lin, Jimin, Liu, Guixin

arXiv.org Machine LearningAug-2-2024

Providing an arbitrage-free valuation formula and specifying risk-neutral dynamics are essentially two sides of the same coin in option pricing. Yet, the modeling methodology has been leaning towards the latter for decades. That is, the invention of an option pricing model typically starts with proposing a stochastic process that is a martingale for the underlying asset, so that the corresponding risk-neural measure is constructed, and henceforth the arbitrage-free option valuation can be determined either analytically or numerically. Such a methodology was established through the pioneering work of Bachelier [4] and Black and Scholes [9], and since then, almost all of the prevailing models have been invented along this paradigm. The list includes but is not limited to local volatility models by Dupire [17], Cox [14], stochastic volatility models by Heston [20], Hagan et al. [18], Bates [8], jump-diffusion models by Merton [28], Kou [24], and other models built upon Lévy processes by Madan et al. [26], Barndorff-Nielsen [7]. Nonetheless, the reverse approach, which first provides an arbitrage-free valuation formula as in Carr and Madan [11], Davis and Hobson [15] and then finds the underlying martingale supporting the formula, is still possible, as noted in [21, 27]. In recent work, Carr and Torricelli [12] starts with one particular pricing formula that yields logistically distributed marginals. Although there is no underlying Lévy process that produces such marginals, by allowing the increment to be nonstationary, an additive logistic process can be constructed to support that pricing formula.

neural term structure, term structure, volatility surface, (14 more...)

arXiv.org Machine Learning

2408.01642

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.40)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Leveraging PAC-Bayes Theory and Gibbs Distributions for Generalization Bounds with Complexity Measures

Viallard, Paul, Emonet, Rémi, Habrard, Amaury, Morvant, Emilie, Zantedeschi, Valentina

arXiv.org Machine LearningFeb-19-2024

In statistical learning theory, a generalization bound usually involves a complexity measure imposed by the considered theoretical framework. This limits the scope of such bounds, as other forms of capacity measures or regularizations are used in algorithms. In this paper, we leverage the framework of disintegrated PAC-Bayes bounds to derive a general generalization bound instantiable with arbitrary complexity measures. One trick to prove such a result involves considering a commonly used family of distributions: the Gibbs distributions. Our bound stands in probability jointly over the hypothesis and the learning sample, which allows the complexity to be adapted to the generalization gap as it can be customized to fit both the hypothesis class and the task.

equation, generalization gap, hypothesis, (14 more...)

arXiv.org Machine Learning

2402.13285

Country:

Europe > Spain (0.04)
Europe > France > Brittany > Ille-et-Vilaine > Rennes (0.04)

Genre: Research Report (1.00)

Industry: Information Technology (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

Learnable Subspace Clustering

Li, Jun, Liu, Hongfu, Tao, Zhiqiang, Zhao, Handong, Fu, Yun

arXiv.org Machine LearningApr-9-2020

This paper studies the large-scale subspace clustering (LSSC) problem with million data points. Many popular subspace clustering methods cannot directly handle the LSSC problem although they have been considered as state-of-the-art methods for small-scale data points. A basic reason is that these methods often choose all data points as a big dictionary to build huge coding models, which results in a high time and space complexity. In this paper, we develop a learnable subspace clustering paradigm to efficiently solve the LSSC problem. The key idea is to learn a parametric function to partition the high-dimensional subspaces into their underlying low-dimensional subspaces instead of the expensive costs of the classical coding models. Moreover, we propose a unified robust predictive coding machine (RPCM) to learn the parametric function, which can be solved by an alternating minimization algorithm. In addition, we provide a bounded contraction analysis of the parametric function. To the best of our knowledge, this paper is the first work to efficiently cluster millions of data points among the subspace clustering methods. Experiments on million-scale datasets verify that our paradigm outperforms the related state-of-the-art methods in both efficiency and effectiveness.

parametric function, representation, subspace, (15 more...)

arXiv.org Machine Learning

2004.0452

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

How to use Chainer for Theano users

@machinelearnbotOct-16-2017, 14:46:00 GMT

As we mentioned on our blog, Theano will stop development in a few weeks. Many aspects of Chainer were inspired by Theano's clean interface design, so we would like to introduce Chainer to users of Theano. We hope this article assists interested Theano users to move to Chainer easily. First, let's summarize the key similarities and differences between Theano and Chainer. In this post, we assume that the modules below have been imported.

artificial intelligence, chainer, machine learning, (15 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback